智能论文笔记

Insights into undergraduate pathways using course load analytics

Conrad Borchers , Zachary A. Pardos

分类：机器学习

2022-12-20

Course load analytics (CLA) inferred from LMS and enrollment features can offer a more accurate representation of course workload to students than credit hours and potentially aid in their course selection decisions. In this study, we produce and evaluate the first machine-learned predictions of student course load ratings and generalize our model to the full 10,000 course catalog of a large public university. We then retrospectively analyze longitudinal differences in the semester load of student course selections throughout their degree. CLA by semester shows that a student's first semester at the university is among their highest load semesters, as opposed to a credit hour-based analysis, which would indicate it is among their lowest. Investigating what role predicted course load may play in program retention, we find that students who maintain a semester load that is low as measured by credit hours but high as measured by CLA are more likely to leave their program of study. This discrepancy in course load is particularly pertinent in STEM and associated with high prerequisite courses. Our findings have implications for academic advising, institutional handling of the freshman experience, and student-facing analytics to help students better plan, anticipate, and prepare for their selected courses.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Learning the shape of protein micro-environments with a holographic convolutional neural network

Michael N. Pun , Andrew Ivanov , Quinn Bellamy , Zachary Montague , Colin LaMont , Philip Bradley , Jakub Otwinowski , Armita Nourmohammad

分类：机器学习

2022-11-05

Proteins play a central role in biology from immune recognition to brain activity. While major advances in machine learning have improved our ability to predict protein structure from sequence, determining protein function from structure remains a major challenge. Here, we introduce Holographic Convolutional Neural Network (H-CNN) for proteins, which is a physically motivated machine learning approach to model amino acid preferences in protein structures. H-CNN reflects physical interactions in a protein structure and recapitulates the functional information stored in evolutionary data. H-CNN accurately predicts the impact of mutations on protein function, including stability and binding of protein complexes. Our interpretable computational model for protein structure-function maps could guide design of novel proteins with desired function.

translated by 谷歌翻译

Text Simplification of College Admissions Instructions: A Professionally Simplified and Verified Corpus

Zachary W. Taylor , Maximus H. Chu , Junyi Jessy Li

分类：自然语言处理

2022-09-09

接受高等教育对于少数族裔和新兴双语学生至关重要。但是，高等教育机构用来与准学生交流的语言通常太复杂了。具体而言，美国的许多机构发布录取申请指令远远高于典型高中毕业生的平均阅读水平，通常接近13年级或14年级。这导致学生之间不必要的障碍和获得高等教育。这项工作旨在通过简化文本来应对这一挑战。我们介绍PSAT（专业简化的录取文本），这是一个数据集，其中有112条从美国的高等教育机构中随机选择的录取说明。然后，这些文本将被专业地简化，并被各个机构招生办公室的专职员工专家进行了验证和接受。此外，PSAT带有1,883个原始简化句子对的手动对齐。结果是在与现有简化资源不同的高风险流派中评估和微调文本简化系统的首个语料库。

translated by 谷歌翻译

A Zeroth-Order Momentum Method for Risk-Averse Online Convex Games

Zifan Wang , Yi Shen , Zachary I. Bell , Scott Nivison , Michael M. Zavlanos , Karl H. Johansson

分类：机器学习 | (统计)机器学习

2022-09-06

我们考虑在重复的未知游戏中进行规避风险的学习，在这种游戏中，代理商的目标是最大程度地减少其个人产生高成本的风险。具体而言，代理商使用处于风险的条件值（CVAR）作为风险措施，并以每集选定动作的成本值的形式依靠强盗反馈来估算其CVAR值并更新其动作。使用匪徒反馈来估计CVAR的一个主要挑战是，代理只能访问其自身的成本值，但是，这取决于所有代理的行为。为了应对这一挑战，我们提出了一种新的规避风险的学习算法，并利用有关成本价值的完整历史信息。我们表明，该算法实现了子线性的遗憾，并匹配了文献中最著名的算法。我们为欧洲大师游戏提供了数值实验，该游戏表明我们的方法表现优于现有方法。

translated by 谷歌翻译

Federated Select: A Primitive for Communication- and Memory-Efficient Federated Learning

Zachary Charles , Kallista Bonawitz , Stanislav Chiknavaryan , Brendan McMahan , Blaise Agüera y Arcas

分类：机器学习

2022-08-19

联合学习（FL）是以保护隐私方式在异质客户设备上进行机器学习的框架。迄今为止，大多数FL算法都在多个回合中学习一个“全局”服务器模型。在每回合中，相同的服务器模型都向所有参与的客户端广播，在本地更新，然后跨客户端进行汇总。在这项工作中，我们提出了一个更一般的过程，客户“选择”了发送给他们的值的程序。值得注意的是，这使客户可以在较小的数据依赖性切片上操作。为了使这种实用性，我们概述了原始的联合选择，该选择可以在现实的FL系统中进行特定于客户的选择。我们讨论了如何使用联合选择进行模型培训，并表明它可以导致通信和客户记忆使用情况的急剧减少，从而有可能使模型的训练太大而无法适合处个设备。我们还讨论了联邦选择对隐私和信任的含义，这反过来影响了可能的系统约束和设计。最后，我们讨论有关模型体系结构，隐私保护技术和实用FL系统的开放问题。

translated by 谷歌翻译

Differentiable Collision Detection for a Set of Convex Primitives

Kevin Tracy , Taylor A. Howell , Zachary Manchester

分类：机器人

2022-07-01

物体之间的碰撞检测对于机器人系统的模拟，控制和学习至关重要。但是，现有的碰撞检测例程本质上是非差异的，从而限制了它们在基于优化的算法中的实用性。在这项工作中，我们提出了一个完全可区分的碰撞检测框架，该框架的原因是一组可复合和高度表达的凸原始形状之间的距离。这是通过将碰撞检测问题制定为凸优化问题来实现的，该问题旨在在有相交之前找到要应用于每个对象的最小均匀缩放率。优化问题是完全可区分的，并且能够返回每个对象上的碰撞检测状态以及接触点。

translated by 谷歌翻译

DiffPills: Differentiable Collision Detection for Capsules and Padded Polygons

Kevin Tracy , Taylor A. Howell , Zachary Manchester

分类：机器人

2022-07-01

碰撞检测在机器人系统的模拟，控制和学习中起重要作用。但是，对于对象的配置，没有现有的方法是可区分的，极大地限制了可以在碰撞检测顶部构建的算法。在这项工作中，我们通过将这些问题作为可区分的凸二次程序程序提出，提出了胶囊和填充多边形之间的一组可区分的碰撞检测算法。所得算法能够返回一个接近值，以指示是否发生了碰撞以及对象之间的最接近点，所有对象都是可区分的。结果，它们可以在其他基于梯度的优化方法中可靠地使用，包括轨迹优化，状态估计和强化学习方法。

translated by 谷歌翻译

A Dynamic Data Driven Approach for Explainable Scene Understanding

Zachary A Daniels , Dimitris Metaxas

分类：计算机视觉

2022-06-18

场景理解是计算机视觉领域的一个重要主题，并通过应用于各种领域的应用程序来说明计算挑战，包括遥感，监视，智能农业，机器人技术，自主驾驶和智能城市。我们考虑对场景的主动解释驱动的理解和分类。假设使用一个或多个传感器的代理放置在未知的环境中，并且基于其感觉输入，代理需要将某些标签分配给感知的场景。代理可以调整其传感器以捕获有关场景的其他详细信息，但是与传感器操纵相关的成本，因此，对于代理商而言，以快速有效的方式了解场景很重要。同样重要的是，代理商不仅要了解场景的全局状态（例如，场景的类别或场景中发生的重大事件），而且还了解支持对决定的决策和预测的场景的特征/属性场景的全球状态。最后，当代理遇到未知场景类别时，它必须能够拒绝将标签分配给场景，请求人类的援助，并根据人类提供的反馈更新其基础知识库和机器学习模型。我们介绍了动态数据驱动框架，用于主动解释驱动的场景分类。我们的框架是标题为敏锐的：通过解释驱动的网络主动分类和理解方法。为了展示提出的敏锐方法的实用性，并展示了如何适应特定于域的应用，我们专注于一个示例研究，该案例研究涉及使用具有基于视觉传感器的活性机器人的主动机器人，即，即电光相机。

translated by 谷歌翻译

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aarohi Srivastava , Abhinav Rastogi , Abhishek Rao , Abu Awal Md Shoeb , Abubakar Abid , Adam Fisch , Adam R. Brown , Adam Santoro , Aditya Gupta , Adrià Garriga-Alonso

分类：自然语言处理 | 人工智能 | 机器学习 | (统计)机器学习

2022-06-09

语言模型既展示了定量的改进，又展示了新的定性功能，随着规模的增加。尽管它们具有潜在的变革性影响，但这些新能力的特征却很差。为了为未来的研究提供信息，为破坏性的新模型能力做准备，并改善社会有害的效果，至关重要的是，我们必须了解目前和近乎未来的能力和语言模型的局限性。为了应对这一挑战，我们介绍了超越模仿游戏基准（Big Bench）。 Big Bench目前由204个任务组成，由132家机构的442位作者贡献。任务主题是多样的，从语言学，儿童发展，数学，常识性推理，生物学，物理学，社会偏见，软件开发等等。 Big-Bench专注于被认为超出当前语言模型的功能的任务。我们评估了OpenAI的GPT型号，Google内部密集变压器体系结构和大型基础上的开关稀疏变压器的行为，跨越了数百万到数十亿个参数。此外，一个人类专家评估者团队执行了所有任务，以提供强大的基准。研究结果包括：模型性能和校准都随规模改善，但绝对的术语（以及与评估者的性能相比）；在模型类中的性能非常相似，尽管带有稀疏性。逐渐和预测的任务通常涉及大量知识或记忆成分，而在临界规模上表现出“突破性”行为的任务通常涉及多个步骤或组成部分或脆性指标；社交偏见通常会随着含糊不清的环境而随着规模而增加，但这可以通过提示来改善。

translated by 谷歌翻译